Temporal control and training selection for HMM-based system
نویسندگان
چکیده
Most speaker-independent acoustic-phonetic decoding systems are based on hidden Markov models. Such systems lack a real temporal control for the phonetic models. Furthermore, inter-speaker variability makes speaker adaptation necessary. In order to solve these problems, we introduce two original approaches. On the one hand, discontinuities detected with the ForwardBackward Divergence method are used to constrain phonetic transitions and to perform a more accurate temporal control. On the other hand, an efficient interspeaker measure, based on AR-vector models, allows the selection of a speaker neighbourhood and the adaptation of the phonetic models. The contribution of these two methods is estimated on the TIMIT database.
منابع مشابه
A New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients
In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...
متن کاملMAN-MACHINE INTERACTION SYSTEM FOR SUBJECT INDEPENDENT SIGN LANGUAGE RECOGNITION USING FUZZY HIDDEN MARKOV MODEL
Sign language recognition has spawned more and more interest in human–computer interaction society. The major challenge that SLR recognition faces now is developing methods that will scale well with increasing vocabulary size with a limited set of training data for the signer independent application. The automatic SLR based on hidden Markov models (HMMs) is very sensitive to gesture's shape inf...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملMixture splitting technic and temporal control in a HMM-based recognition system
In this paper, we study various technics to improve the performance, to reduce the computation cost and the required memory of a recognition system based on HMM. For the efficiency of the system, we first study the optimization of the number of HMM parameters according to training data. We experiment a temporal control of the phonetic transitions on lexical decoding task with a significant 5% i...
متن کاملA Bayesian Approach to Temporal Data Clustering using Hidden Markov Models
This paper presents clustering techniques that partition temporal data into homogeneous groups, and constructs state based proles for each group in the hidden Markov model (HMM) framework. We propose a Bayesian HMM clustering methodology that improves upon existing HMM clustering by incorporating HMM model size selection into clustering control structure to derive better cluster models and part...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995